Search CORE

79 research outputs found

API Fluency

Author: Janes Andrea
Lungu Mircea
Robbes Romain
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2019
Field of study

The IT University of Copenhagen's Repository

Refining code ownership with synchronous changes

Author: Hattori Lile
Lanza Michele
Robbes Romain
Publication venue
Publication date: 18/06/2018
Field of study

When mining software repositories, two distinct sources of information are usually explored: the history log and snapshots of the system. Results of analyses derived from these two sources are biased by the frequency with which developers commit their changes. We argue that the usage of mainstream SCM (software configuration management) systems influences the way that developers work. For example, since it is tedious to resolve conflicts due to parallel commits, developers tend to minimize conflicts by not contemporarily modifying the same file. This however defeats one of the purposes of such systems. We mine repositories created by our tool Syde, which records changes in a central repository whenever a file is compiled locally in the IDE (integrated development environment) by any developer in a multi-developer project. This new source of information can augment the accuracy of analyses and breaks new ground in terms of how such information can assist developers. We illustrate how the information we mine provides a refined notion of code ownership with respect to the one inferred by SCM system data. We demonstrate our approach on three case studies, including an industrial one. Ownership models suffer from the assumption that developers have a perfect memory. To account for their imperfect memory, we integrate into our ownership measurement a model of memory retention, to simulate the effect of memory loss over time. We evaluate the characteristics of this model for several strengths of memor

RERO DOC Digital Library

Evaluating defect prediction approaches: a benchmark and an extensive comparison

Author: D'Ambros Marco
Lanza Michele
Robbes Romain
Publication venue
Publication date: 18/06/2018
Field of study

Reliably predicting software defects is one of the holy grails of software engineering. Researchers have devised and implemented a plethora of defect/bug prediction approaches varying in terms of accuracy, complexity and the input data they require. However, the absence of an established benchmark makes it hard, if not impossible, to compare approaches. We present a benchmark for defect prediction, in the form of a publicly available dataset consisting of several software systems, and provide an extensive comparison of well-known bug prediction approaches, together with novel approaches we devised. We evaluate the performance of the approaches using different performance indicators: classification of entities as defect-prone or not, ranking of the entities, with and without taking into account the effort to review an entity. We performed three sets of experiments aimed at (1) comparing the approaches across different systems, (2) testing whether the differences in performance are statistically significant, and (3) investigating the stability of approaches across different learners. Our results indicate that, while some approaches perform better than others in a statistically significant manner, external validity in defect prediction is still an open problem, as generalizing results to different contexts/learners proved to be a partially unsuccessful endeavo

RERO DOC Digital Library

On how often code is cloned across repositories

Author: Lungu Mircea
Robbes Romain
Schwarz Niko
Publication venue: IEEE (The Institute of Electrical and Electronics Engineers)
Publication date: 01/01/2012
Field of study

Proceedings - University of Groningen

On how often code is cloned across repositories

Author: Lungu Mircea
Robbes Romain
Schwarz Niko
Publication venue: IEEE (The Institute of Electrical and Electronics Engineers)
Publication date: 01/01/2012
Field of study

Dissertations of the University of Groningen

How Developers Use the Dynamic Features of Programming Languages: The Case of Smalltalk

Author: Callau Oscar
Robbes Romain
Röthlisberger David
Tanter Eric
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2011
Field of study

Bern Open Repository and Information System (BORIS)

On porting software visualization tools to the web

Author: D'Ambros Marco
Lanza Michele
Lungu Mircea
Robbes Romain
Publication venue
Publication date: 18/06/2018
Field of study

Software systems are hard to understand due to the complexity and the sheer size of the data to be analyzed. Software visualization tools are a great help as they can sum up large quantities of data in dense, meaningful pictures. Traditionally, such tools come in the form of desktop applications. Modern web frameworks are about to change this status quo, as building software visualization tools as web applications can help in making them available to a larger audience in a collaborative setting. Such a migration comes with a number of promises, perils, and technical implications that must be considered before starting any migration process. In this paper, we share our experiences in porting two such tools to the web and provide guidelines about the porting. In particular, we discuss promises and perils that go hand in hand with such an endeavor and present a number of technological alternatives that are available to implement web-based visualization

RERO DOC Digital Library

Big Code != Big Vocabulary: Open-Vocabulary Models for Source Code

Author: Babii Hlib
Janes Andrea
Karampatsis Rafael-Michael
Robbes Romain
Sutton Charles
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 17/03/2020
Field of study

Statistical language modeling techniques have successfully been applied to large source code corpora, yielding a variety of new software development tools, such as tools for code suggestion, improving readability, and API migration. A major issue with these techniques is that code introduces new vocabulary at a far higher rate than natural language, as new identifier names proliferate. Both large vocabularies and out-of-vocabulary issues severely affect Neural Language Models (NLMs) of source code, degrading their performance and rendering them unable to scale. In this paper, we address this issue by: 1) studying how various modelling choices impact the resulting vocabulary on a large-scale corpus of 13,362 projects; 2) presenting an open vocabulary source code NLM that can scale to such a corpus, 100 times larger than in previous work; and 3) showing that such models outperform the state of the art on three distinct code corpora (Java, C, Python). To our knowledge, these are the largest NLMs for code that have been reported. All datasets, code, and trained models used in this work are publicly available.Comment: 13 pages; to appear in Proceedings of ICSE 202

arXiv.org e-Print Archive

Edinburgh Research Explorer

DIE: A Domain Specific Aspect Language for IDE Events

Author: Denker Marcus
Fabry Johan
Robbes Romain
Publication venue: Graz University of Technology, Institut für Informationssysteme und Computer Medien
Publication date: 01/02/2014
Field of study

International audienceIntegrated development environments (IDEs) have become the primary way to develop software. Besides just using the built-in features, it becomes more and more important to be able to extend the IDE with new features and extensions. Plugin architectures exist, but they show weaknesses related to unanticipated extensions and event handling. In this paper, we argue that a more general solution for extending IDEs is needed. We present and discuss a solution, motivated by a set of concrete examples: a domain specific aspect language for IDE events. In it, join points are events of interest that may trigger the advice in which the behavior of the IDE extension is called. We show how this allows for the development of IDE plugins and demonstrate the advantages over traditional publish/subscribe systems

HAL - Lille 3

CiteSeerX

INRIA a CCSD electronic archive server

HAL Descartes

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY